University of Hagen at GeoCLEF 2007: Exploring Location Indicators for Geographic Information Retrieval

نویسندگان

  • Johannes Leveling
  • Sven Hartrumpf
چکیده

Location indicators are text segments from which a geographic scope can be inferred, e.g. adjectives, demonyms (names for inhabitants of a place), geographic codes, orthographic variants, and abbreviations can be mapped to location names in one or more inferential steps. In this paper, the normalization of location indicators and treating morphology of location indicators for geographic information retrieval (GIR) within the system GIRSA (Geographic Information Retrieval by Semantic Annotation) are explored. Several retrieval experiments are performed on the German GeoCLEF 2007 data, including a baseline IR experiment on stemmed text (0.119 mean average precision, MAP). Results for this experiment are compared to results for experiments with normalized location indicators. Additionally, the latter approach was combined with an approach using semantic networks for retrieval (an extension of an experiment performed for GeoCLEF 2005). When using the topic title and description, the best performance was achieved by the combination of approaches (0.196 MAP); adding location names from the narrative part increased MAP to 0.258. Results indicate that 1) employing normalized location indicators improves MAP and increases the number of relevant documents found; 2) additional location names from the narrative increase MAP and recall, and 3) the semantic network approach has a high initial precision and even adds some relevant documents which were previously not found. For bilingual (English-German) experiments, queries were first translated into German before utilizing the translation as input to GIRSA. Performance for these experiments is generally lower, but reflect results for monolingual German. The baseline experiment (0.114 MAP) is clearly outperformed by all other experiments, achieving the best performance for a setup using title, description, and narrative (0.209 MAP).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

University of Hagen at GeoCLEF 2008: Combining IR and QA for Geographic Information Retrieval

This paper describes the participation of GIRSA at GeoCLEF 2008, the geographic information retrieval task at CLEF. GIRSA is a modified and improved variant of the system which participated at GeoCLEF 2007. It combines results retrieved with methods from information retrieval (IR) on geographically annotated data and question answering (QA) employing query decomposition. For the monolingual Ger...

متن کامل

Inferring Location Names for Geographic Information Retrieval

For the participation of GIRSA at the GeoCLEF 2007 task, two innovative features were introduced to the geographic information retrieval (GIR) system: identification and normalization of location indicators, i.e. text segments from which a geographic scope can be inferred, and the application of techniques from question answering. In an extension of a previously performed experiment, the latter...

متن کامل

GeoCLEF: the CLEF 2005 Cross-Language Geographic Information Retrieval Track

Introduction GeoCLEF is a new track for CLEF 2005. GeoCLEF was run as a pilot track to evaluate retrieval of multilingual documents with an emphasis on geographic search. Existing evaluation campaigns such as TREC and CLEF do not explicitly evaluate geographical relevance. The aim of GeoCLEF is to provide the necessary framework in which to evaluate GIR systems for search tasks involving both s...

متن کامل

Re-Ranking for Geo-Relevance With Non-Contextual Heuristics at GeoCLEF 2007

Geographic Information Retrieval (GIR) in an attempt to improve relevance by taking geographic information in textual documents into account. We describe out experiments carried out at the GeoCLEF 2007 evaluation [1] that investigate further the role of geo-filtering based re-ranking and query expansion with geographic terms. Our main findings are that manual query expansion with geo-terms is m...

متن کامل

GeoCLEF 2007: the CLEF 2007 Cross-Language Geographic Information Retrieval Track Overview

GeoCLEF ran as a regular track for the second time within the Cross Language Evaluation Forum (CLEF) 2007. The purpose of GeoCLEF is to test and evaluate cross-language geographic information retrieval (GIR): retrieval for topics with a geographic specification. GeoCLEF 2007 consisted of two sub tasks. A search task ran for the third time and a query classification task was organized for the fi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007